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ABSTRACT 



For periodic review inventory models with stochastic 
demand, the idea of stock-out risk Is defined, from which 
the importance of accurate prediction of demand is deduced. 
Methods of demand model parameter estimation are investigated 
and several methods compared on the basis of theoretical 
soundness, ease of application, and accuracy of estimates 
based upon the results of extensive computer simulation. The 
theoretical development of maximum likelihood and exponential 
smoothing estimators as applied to prediction is presented 
along with the development of a new Bayesian approach to the 
problem of demand forecasting. 
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INTRODUCTION 



I . 

There is a great deal of interest today in business and 
industry, as well as government activities, in inventory con- 
trol. The current literature of operations research contains 
considerable ideas, methods, mathematical models and algo- 
rithms addressing this particular subject area. One of the 
most difficult problems involved in the application of any 
mathematical model to actual inventory analysis or control 
problems is the estimation of the parameters used in the for- 
mulas of the models and algorithms. To resolve this problem 
operations researchers have concerned themselves with the de- 
velopment of accurate estimation techniques and demand fore- 
casting methods. One of the techniques which has received a 
good deal of attention in the current literature is exponen- 
tial smoothing, a method of estimation and forecasting 
developed by Brown [Ref. 2] . The basic idea of the smoothing 
approach is to use a weighted moving average of demand data in 
such a way as to exponentially discount the oldest data in the 
demand sequence and place most of the weight for an estimate 
on the most recent data available. This exponential discount- 
ing of past demands is controlled by the selection of a smooth- 
ing constant which determines the relative value of the weight 
assigned to the data in the estimate. The exponential smoothing 
technique has been questioned on theoretical grounds by Zehna 
[Ref. 10] and others. It has been suggested that the classical 
maximum likelihood methods will produce better results in a 
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number of demand models and some simulation results are avail- 
able which indicate that this is the. case [Refs. 6 and 11]. 

The purpose of this thesis is to examine the problem of 
forecasting demand and explore several alternative methods 
which should be considered as possible forecasting techniques 
for particular demand models. A simulation analysis of max- 
imum likelihood, smoothing, and Bayesian estimation methods 
was conducted in order to ascertain important or noteworthy 
properties of the techniques when applied to normally distrib- 
uted random demand in the constant mean and linear demand 
models . 

Throughout this report the idea of a risk involved in in- 
ventory operations plays a central role. The demand models 
studied are assumed to be applied to an inventory system in 
which stock is controlled by means of a periodic review of 
inventory levels. That is, at certain specified periods of 
time, monthly, quarterly, etc., the demand for items is deter- 
mined along with the current inventory balance. These data 
are used to requisition replacement stock to insure that the 
system does not run out of stock too often. If the demand v/ere 
deterministic there would be no problem in deciding how much 
to order. However, the demand in this case is assumed random 
and predictions of the demand in future periods must be made 
in order to set reorder levels. It is assumed that resupply 
is instantaneous so that, if it can be determined at the end 
of a period that within a specified probability the demand in 
the next period will be less than a certain quantity, then the 
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reorder le\el is that expected upper limit on the next 
period's demand. The reorder quantity is the reorder level 
less the amount of stock on hand. By deciding what probabil- 
ity of running out of stock, called stock-out risk, is 
acceptable, the decision maker sets a risk level that deter- 
mines the ordering policy. Consequently estimating parame- 
ters and predicting demand in general is a central issue in 
the determination of reasonable reorder policies. 

In Section II. the theoretical development of the maximum 
likelihood estimation technique is presented along with' a 
simplified version which was used by Zehna [Ref. 11] for 
comparison with exponential smoothing. Section III. is a 
summary of the theoretical basis for exponential smoothing 
developed by Brown [Ref. 2] . In addition, at the end of Sec- 
tion III smoothing is applied to a case not previously con- 
sidered by Brown. Section IV presents a comparison of maxi- 
mum likelihood and smoothing with consideration given to 
theoretical basis, ease of application, and simulation re- 
sults. In Section V a Bayesian procedure for demand estima- 
tion and prediction is developed for the case when the 
variance of the demand distribution is known. The resulting 
Bayes estimation method is compared to exponential smoothing 
on the basis of computer simulation. 

Since simulation was used throughout this study a special 
section on this technique is presented at the beginning of 
Section IV. It should be noted that in ^previous cases 
studied by Zehna [Ref. 11) and Ornek [Ref. 6], simulation was 
also used as one method of comparing MLE and smoothing. There 
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is a basic difference in the approach in those papers and 
the approach taken in this study. Their procedure was to 
generate random demand over a long period of time, 1000 
periods, rasking estimates with the various methods at each 
period, and determining the sample risk (probability of 
running out of stock) over the entire time of operation. 

The approach taken here, however, realizing that in many ap- 
plications only a limited amount of data may be available or 
used as a basis for estimation, is to generate demand data, 
make estimates, and determine the risks involved at various 
points in time, say 5 periods, 20 periods, etc., over a large 
number of replications of the experiment. It is felt that 
the results of this simulation method should prove useful in 
determining which estimating techniques perform well when 
estimates are based upon a fairly limited amount of data. 
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II. MAXIMUM LIKELIHOOD ESTIMATION 



A classical approach to forecasting or predicting the 
value of a random variable based upon past observations is 
to develop a functional relationship between the observed 
values of the random variable and some control variable us- 
ing regression analysis . Applied to the present circumstances 
one may view the random variable Y as demand and the control 
variable x as time. Demand is observed at various times 

x, , x„, •••, x and then inferences are made about the de- 

1' 2' ' n 

pendence of Y on x. 

In this section the results of the General Linear Hypoth- 
esis are examined as they apply to forecasting demand when 
the mean demand is a linear function of time, that is, 

E (Y^) = a + bx^ . 



A. GENERAL LINEAR MODEL 

1. Estimate of the Coefficients 



If Y = AB + e, where A = 



X 1 



X, 



n 



, and 6 = 



then Y. = a + bx. + e. where e . % N(0, o ) , i=l , 2 , • • • ,n 



It 



follows that E(Y^) = a + bx^ and V(Y^) = o , i=l,2,***,n . 

It can be shown, see Zehna [Ref. 12], that the maximum like- 



lihood estimates of 3 and o are: 
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3 = Q -1 A' Y and c 



(Y - A P) (Y - A 6 ) 



n 



respectively, where Q = A’ A. It is further true that , 
the random vector corresponding to is (3^, a Q ) where 
indicates a bivariate normal distribution. Algebraically 
then , 



Q = 



n 



nx 



n 



— v* 2 

nx ) x . 

i=i 1 



, and Q 



-1 



ns 



x 



n 9 
V - 2 



x . -nx 



i=l 



-nx 



n 



2 n — 2 n 
where s = 1/n £ (x. - x) and x - 1/n £ x. 

x i=l 1 i=l 1 



Then 



ji = Q 1 A’Y = 1/ns 2 



n 2 



y ) x z ~ x y x. y . 

1 . L T 3. • 1 1"* 1 

1=1 1=1 



n 



l (X. - x) (y . - y) 
i=l 



The above reduces to 



b = 



n 

l x i Y i “ nxy 
i-1 

£ 2 -2 
) x, - nx 

i=i 1 



( 2 - 1 ) 



a = y - xb 



( 2 - 2 ) 



2. Variance of the Estimators 



Since is N 2 ($., 0 2 Q 1 ) it follows that: 
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E(8) = 6 f V (a) 



2 v 2 

o ) x . 
i=l 1 



n 



n 



-x 2 



, v(b) = 



a 



5! (X. - X) 



n 



i=l 



I (x. - X) 



-X 2 



i=l 



( 2 - 3 ) 



A A 

A A 



and COV(a, b) = 



-c 2 x 



n 



l (X. - x) 

i=l 1 



These quantities depend only upon the total number of periods 
of demand observed and the variance of the underlying distri- 
bution in the model. The relative sizes of these variances 
are indicated in the following table. 



TABLE I 

VARIANCE OF a and b 



a 


ft 2 

V(a)/o 


1 ^ 

| D>> 

\ 

NJ 


COV (a, b)/a 2 


10 


0.4667 


0.0121 


- 0.0667 


50 


0.0825 


0.0001 


- 0.0025 


100 


0.0406 


0.0000 


- 0.0006 


1000 


0.0040 


0.0000 


- o:oooi 



The maximum likelihood estimate of o is 



n 



= 1/n l (y - 0 
i=l 1 1 



3 2 x i) 



It is well known that 

■ 2 



nu . 2 

■ - - — v 

2 K n-2 
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so that 



A 

~ 2 

— _ is an unbiased, estimator for a ^ . 
n-2 



3 . Prediction 

The main purpose of using estimation in inventory 
applications is to forecast demand some future period of 
time. Most often it is necessary to observe demand in peri- 
ods 1, 2, •••, n, and utilizing these data predict what de- 

mand will be in the next period in order to set a reorder 
level such that within tolerable risk limits the system will 
not run out of stock. 

The demand y at time x ^ x, , • • • , x , may be 

J o o 1 n 

considered as a value of the random variable Y which is 

o 

2 

distributed N (a + bx , o ) . Then 



Y = 
— o 




% 



N (A 6,0' I) 
n+1 — o — 



where A = 
— o 



1 x 



o 



. It can be shown that 





and is independent of no . It follows that 



Y - a - bx^ 
o o 



1 + 1/n + 



lx - *o> 

n - 2 

.1 < x i - X) 
1=1 



no 



o (n - 2) 
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has a t-dis tribution with n-2 degrees of freedom. Thus if 
a reorder level R is set by means of 



(x - x ) 

R = a + bx + t „ , S / 1 + 1/n + ° 

o n-2 , 1-a y • x / n 



(2-4) 



M 



l (X. - x)“ 

i=l 



where a and b are as defined previously, 



/ n 

I p A A 

l “ a “ bx 2 ) 

i=l 



y « x 



n 






, and t „ . m is the 
n-2, 1-a 



(l-a)th percentile of the t-distribution with n-2 degrees 
of freedom, the probability that the demand in period x q will 
be less than or equal to R is 1-a. This assumes that a risk, 
of |l00 a|% is acceptable. 

It should be noted that the length of this one-sided upper 
prediction limit for Y q increases as the period for which the 
forecast is made differs in time from x. This fact should 
discourage "Extrapolation" too far above the midpoint of the 
sampling interval. However, the forecast should be good for 
the (n+1) st period. 



B. CONSTANT MEAN MODEL 

The constant mean model may be considered as a special 
case of the linear model above with b = 0. For this model, 

Y. = a + e. where e. ^ N(0, c 2 ) i=l,2,“*,n 

1 1 X ' t i r 

1 . Estimates of the Coefficient and Variance 

Utilizing the same procedure as before it is found 



that 



n 



= Vn l y. = y , 

i=l 1 



(2-5) 



and 



2 



E ( a ) = a , V ( a ) = a /n (2-6) 

~ ? n 
o = 1/n l (y. - y) 

i=l 1 

2 . Prediction 

A 

Again it can be shown that Y - a has a 

2 

N(0, o [1 1/n]) distribution and is independent of no" 

*2 

n ^ 2 

— has ay „ distribution so that 
„2 A n -2 



Y - a 
o 



y 1 + l,7n 



is distributed as t 



*2 

no 



n-1 ’ 



o (n-1) 



It follows that setting a reorder level by means of 



A A 

R = a + t 



it S \l 1 + l/rT 
n-1, l- a y\J 



(2-7) 




n-1 

will yield the desired risk a . 



C. LINEAR MODEL WITH Y-INTERCEPT = 0 

In many cases it might be reasonable to assume that there 
is no minimum initial demand imposed on the system, i.e., at 
time x = 0 , Y = 0 , and that demand grows linearly with time . 
This may be expressed as 

2 

Y^ = bx^ + where 'v* N(0, a ) i=l,2,»»»,n. 
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In matrix notation 



Y = Ab + e where A 



x. 



x 



n 



1. Estimate of Coefficient and Variance 



Proceeding as before, 
n ? 

q = A 1 A = l xf Q 

i=l 1 



n 



- 1 =■ i/ i x 2 
iA 1 



n 

1 x • y • 

b = Q _1 A'Y = 1 1 



n 






( 2 - 8 ) 



/v 2-1 

V (b) = a Q 



a 



V 2 
) x - 

i=l 1 



n 



a = (Y - Ab) ' (Y - Ab) 



1/n l (y. 

i=l 1 



E (b) = b 



bx. ) 

l 



(2-9) 



2. Prediction 



As in the previous case it can be shown that 

2 

£ „ x 

Y - bx has a N(0, cs [1 + — — ]) distribution, 

o o ' n „ ' 



l x 

i=l 1 



n a . 2 

_2 1S X n-1 ' 



and further that Y q - bx Q and no are independent. 
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Thus 



Y - bx 
o 



o 




^ t , 
n-1 




I a 2 (n-1) 



The appropriate reorder level to obtain a risk a is 




( 2 - 10 ) 



where S 



y • x 




and t , , is the (l-a)th precentile of the t-distributi :>n 

n-1, I-a 



with n-1 degrees of freedom. 

D. CONSTANT MEAN-SIMPLIFIED PROCEDURE 

A more easily applied technique for setting the reorder 

level in the constant mean case was demonstrated by Zehna 

[Ref. 11] and Ornek [Ref. 6] with very good results. The 

procedure was used only as an attempt to compare the accuracy 

of maximum likelihood estimation with that of estimating 

parameters using exponential smoothing. 

Since demand is normally distributed with mean y and 
2 

variance o there is, of course, a theoretical reorder level 
y + Ko such that 



Pr (Y < y + Ka) = 1 - a 



( 2 - 11 ) 
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where K is the (1 - a) th percentile of the standard normal 
random variable — N(0, 1) . 

Hence maximum likelihood methods may be used to estimate 
y and o separately and then these estimates combined as in 
Eq. (2-11) to set the reorder level. In this way. 



y 



and 



n 



1/n l y. = y 
i=l x 



o = 




n - 1 



R = y + Ka . 



( 2 - 12 ) 



This method of course fails to consider the actual distri- 

A A 

butxons of y and a so that the true risk so obtained is prob- 
ably not exactly a, but for many purposes the ease of 
application may make the tradeoff worthwhile. 

It is possible to make statements about the expected 
value and variance of this estimate of reorder level as the 

A A 

distributions of y and a are well known, and the correspond- 
ing random variables are further known to be independent. 

/n £ 2 

From Eq . (2-6) , E(y) = y and V(y)=a/n. It is shown 

in Hald [Ref. 5] that a has a Chi-distribution and in 
particular 

E(3) 4 ‘‘n'V 1 " n ' ° WhSre a n = Yr?T 

= ~\f 1 “ 2 ( n - TT 

E( a) =Y ^ a a 
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n - 1 ,2 2 . .2 , 2 • 1 

v(0) , — — b n 0 where b n = 1 - a n = r(7r ^- 1T 



/V. • 



1 2 

V (a) = =— • a 

2n n->°° 



>• 0 



Y 



Thus E(R) = y + K / 1 - 



, K + 2 2 

and V (R) = — ^ o 



2n 



n^co 



2n n~>°° 
-> 0 . 



-> y + Ka 
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III. EXPONENTIAL SMOOTHING 



As indicated in the introduction, exponential smoothing 
is a method of forecasting discrete time series, in this 
case future; demand, based upon past observations of the sys- 
tem. The major source of information on the development and 
application of exponential smoothing is the book by Robert 
G. Brown [Ref. 2], In fact. Brown introduced the concept and 
the very name exponential smoothing in an earlier publication. 
Unless otherwise indicated all of the following discussion of 
the methodology and theory may be found in the reference 
noted above . 

A. EXPONENTIAL SMOOTHING OF PAST OBSERVATIONS 

The basic idea behind exponential smoothing is to assign 
weights to past observations so as to place the most weight 
upon recent observations and less and less weight on the 
older pieces of demand data, thereby limiting the effect on 
the estimate of "dated" information. This may be accomplished 
by selecting a smoothing constant 0 < a < 1 which is exactly 
the weight placed on the current observation. The smoothing 
operator is given by 



S t (Y) 



t-1 . 

a l e x y 




i=0 



(3-1) 
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where £ = 1 - a and y^ y 2 , ♦ •*, y are the observations of 

demand in the first t periods, and y o is some initial demand 

often taken to be zero. For application in this thesis it 

will be assumed that y • = 0. 

o 

Y - S (Y) then is the exponentially smoothed estimate of 
demand in che (t + 1) st period based upon t periods of 
observation . 

Brown claims that exponential smoothing may be utilized 
effectively to accurately estimate the coefficients in vari- 
ous models of discrete time series such as demand or sales 
forecasts. The idea certainly has intuitive appeal and seems 
to be widely used as a forecasting technique. 

For applications of exponential smoothing the number of 
coefficients to be estimated in a model is referred to as the 
degrees of freedom of the model. Corresponding to each degree 
of freedom is an order of exponential smoothing defined by 
applying the smoothing operator recursively. For single 
smoothing, as first order smoothing is called, Eq. (3-1) may 
be written, 

S t (Y) = ay t + BS t _ 1 (Y) ' (3-2) 

where ^(Y) is the current smoothed estimate prior to 
observing y^. Nth order smoothing may then be defined as 

s£(Y) = S[s£ _1 (Y)] = as£ -1 (Y) + 3S^_ 1 (Y) 

where S°(Y) is defined to be S (Y) . As a special case, 
double smoothing becomes 

S^(Y) = aS fc (Y) + 3S^_ 1 (Y) (3-3) 
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B. CONSTANT MEAN MODEL - SINGLE SMOOTHING 

Assuming that demand Y is distributed normally with mean 
2 

y and variance 0 then 

Y i = H + e i where ~ N(0, a 2 ) i=l,2,***,t . 

Since there is but one coefficient to be estimated, single 
smoothing is called for to estimate y. After t periods have 
been observed the exponentially smoothed estimate of the mean 
of the demand distribution is given by 



y = S t (Y) 



“Yt + 3S t-l (Y) 



t-1 

a l 3 1 y 
i=0 



t-i * 



(3- 



3 } 



Note that E(y) = aE(Y) £ B"*’ = y(l ~ 3^) , where y is the ran - 

i=0 

^ 1 i in ~ 

dom variable associated with u, so that, , E(y) = y. Thus, 

“ t->-00 

y is asymptotically unbiased. Also, 



V ( y) 



2 2 
= a a 



t-1 



1=0 



2i 



2 2 
a a 



1 ~ 3 
■ 1+3 




(3-4) 



and V(y) = q 2 ( < a non-zero quantity. 

In attempting to estimate the underlying variance of the 
demand distribution (not considered a coefficient in the 
model). Brown chooses to use mean absolute deviation, MAD for 
short, a measure of variability which has questionable theo- 
retical basis and properties [Ref. 10] . MAD is defined as 

A = E( | Y - y | ) . (3-5) 



It is well known that when Y is normally distributed the 
ratio of A to a is ~\j , about 0.79 8. This relationship does 

not hold however in several other important probability 



21 



distributions . 



To estimate o Brown first estimates A by 
smoothing forecast errors defined as 

e ( t) = y fc - S t _ 1 (Y) , (3-6) 

the forecast error in period t. The smoothed estimate for 
A is given by 

t-1 . 

A t - «.l eVt-il 

i=0 1 

or in operator form 

A t = a|e t | + 0 A t-1 . .(3-7) 

Then the smoothed estimate of o is given by 

o = YY ^ 0 - 8 ) 

implicitly assuming the smoothed estimate for A enjoys the 
same invariance principle known to be true for maximum like- 
lihood estimates. 

It has been clearly pointed out by Zehna [Ref. 11] and 
others that such a procedure as that described above is sta- 
tistically suspect. Several attempts have been made to de- 
velop the distribution theory necessary to make theoretical 
judgments concerning A and the invariance principle implicit 
in obtaining a but so far the mathematics has proven 
intractible [Ref. 6], 

Disregarding the statistical questions involved, Brown 
combines the smoothed estimates of y and a as in Eq . (2-12) 

to obtain a smoothed estimate of reorder level. In this way, 

R = y + KO (3-9) 
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where K is the appropriate percentile of the N(0, 1) 
distribution. 

The initial condition S q (Y) is either zero or some best 
guess as tc what the value of y will be prior to making any 
observations. The effect of this initial value is negli- 
gible after 30-45 periods for . 1 <_ « <_ . 9 

Some remarks may be made concerning the expected value 
and variance of R, the random variable corresponding to R, 



E (a) 




E(S) 



~ t-1 . 

E(A) = a l p X E ( | e 
i=0 ' 



) : 



E ( | e fc _^ |) is shown in [Ref. 11] to be o~\j ^ -- 2 - 



a 



2 

IT 



so that 



E (a) = (1 - 3 t )a 



t->00 



■> a , 



( 3-1C ) 



indicating that a is asymptotically unbiased if all the 
underlying assumptions are valid. 

As stated earlier, no analytical results have been ob- 
tained concerning the distribution of A and the problems 
involved extend into the intractibi lity of obtaining the exact 
variance of a. However, Ornek [Ref. 6] was able to develop a 

method for approximating V(a) numerically. V(c) is dependent 
2 

upon both a and the value of the smoothing constant a. That 
is, 

V(o) = c 2 a 2 (3-11) 



where C may be found in the following table. 
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TABLE II 



cr VARIANCE FACTORS 



a 


C 




t 


1 


0.10 


0 .1745 


t 


> 


80 


0.15 


0.2173 


t 


> 


so; 


0.20 


0.2553 


t 


> 


35: 


0.25 


0.2905 


t 


> 


30 


0 . 30 


0 . 3239 


t 


> 


25 1 


0 . 35 


0 . 3561 


t 


> 


20 


0 .40 


0 .3876 


t 


> 


15 



The values of C were calculated by Ornek [Ref. 6] and the 
number of periods necessary for a good approximation were 
determined in the course of computer simulation and testing 
of the model . 

Using the results of Eqs . (3-3) - (3-11) the expected 

value of R may be written 

E(R) = (1 - g fc ) [y + Kcr) y + Ka , 

and the variance of M may be approximated as 

V (R) = [ ( r jp ~ g) (1 - 3 2t ) + C 2 K 2 ] a 2 (3-12) 

assuming that y and 5 are independent. Again, the variance 
of R is non-zero even for large t. 
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C. LINEAR MODEL — DOUBLE SMOOTHING 

If the demand in period t is of the form 

= a + bt + e t where e a, N(0, a 2 ) 

then E ( Y^) = a + bi and V(Y i ) = a 2 , i=l,2,*».,t 

1 . Estimate of the Coefficient s 

At time t, the coefficients are estimated by a com- 
bination o:; both single and double smoothing and the demand 
at time t + t is estimated as 

^t+x a t + ^t T 

where t is the number of periods in the lead time. The esti- 
mates of the coefficients developed by Brown are 



a t = 2S t ( Y) - S 2 (Y) 



(3-13) 



b t = a/3 t S t ( Y ) - S 2 (Y) ] 

2 

where S fc (Y) and S fc (Y) are given by Eqs . (3-2) and (3-3) . It 

is important to note that a^_ is not an estimate of the para- 
meter a in the model but rather an estimate of the current 
point on the theoretical line at time t. Thus Y^_ + ^ = 
a^. + b^_T is equal to the estimate of the current theoretical 
demand increased by the estimated slope multiplied by the 
number of periods in the lead time. Usually x will be equal 
to one and the forecast will be for the next period only. 

2 . Initial Conditions 

The initial conditions for the linear model are 
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(3-14) 



S o (Y) ~ a o “ (3 /a) b 0 



S o (Y > = - 2 «/a;b 0 

where a Q and b Q are initial guesses as to the values of the 
coefficients a and b. Again these Initial conditions are 
transient and have no effect on the estimate after 30-45 
periods . 

3 . Variance of the Estimators 

Using multivariate analysis Brown claims the follow- 
ing asymptotic formulas for the variances of a and 6 in the 
linear case, 



v(I t ) 



a(l + 43 + 5B 2 ) a 2 = Aa 2 

(1 + 3) 3 



V(6 t ) = 

COV(I t/ 



2a' 



(1 + 3) 



2 2 
a = Bcr 



5 t> 



a (1 + 3 3) 
(1 + 3) 3 



Co 



2 



where representative values of A, B, and C are presented in 
Table III. 

4 . Setting the Reorder Level 

As in the previous case the smoothing estimates of 
the mean, p = a^ + b^x and a as defined in Eq. (3-8) may be 
combined in the form 

R.j. = pj. + KcJj. (3-15) 

where K is chosen to obtain the desired risk. In this model 
A is obtained from Eq. (3-7) by smoothing the successive 
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TABLE III 



VARIANCE OF SMOOTHING COEFFICIENTS 



a 


A 


B 


C 


0.10 


.12611 


.00029 


.00539 


0.15 


.18982 


. 0010 7 


.01262 


0 . 20 


.25377 


.00274 


.02332 


0.25 


. 31778 


.00583 


.03790 


0.30 


. 38164 


.01099 


.05679 


0 . 35 


.44508 


.01909 


.08045 


o 

o 


.50781 


.03125 


. 10938 



absolute differences between the estimated point on the line, 
P t , and the actual demand observed in period t. 

D. LINEAR MODEL - Y INTERCEPT = 0 

When the Y - intercept is known to be zero the model is 
given by 

2 

Y i = bx i + e i where ^ N ( 0 , a ), i=l,2,***,t . 

Brown does not consider the model as a special case so no 
smoothed estimate for b is given in [Ref. 2] . Following the 
general smoothing methodology, it is assumed that only single 
smoothing is necessary to estimate the single coefficient in 
the model. At successive periods of time x^, x fc 

the corresponding demand y^, y^ is observed. A 

natural point estimate of the slope at time x^. would be 

E k = y k /x k 
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An estimate of b can then be obtained by smoothing the suc- 
cessive observations of obtaining 

b fc = S t (B) = a(y t /x t ) + 3S t _ 1 (B) . 

Since E (Y^) = bx^ , = b. It follows that 

t-1 . 

E[S (B)]= a l B 1 E(Y ./x. .) = (1 - 3 )b 

i=0 r 1 

and lim E(b ) = b which implies that this smoothed estimator 

t-H» r 

for b is asymptotically unbiased. 

The estimate of a is obtained as in the previous two 
models using MAD. In Eqs. (3-7) and (3-8) the forecast error 
for this procedure may be defined as 
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IV . COMPARISON OF MLE AND SMOOTHING 



In this section the procedures developed in the previous 
tv;o sections are compared chiefly by means of computer simu- 
lation. Before detailing the analysis some general comments 
on the technique of simulation are in order. 

A. COMPUTER SIMULATION 

1 . General Comment s 

Computer simulation is primarily utilized to study 
systems or procedures which cannot be adequately analyzed by 
means of mathematical analysis or hand calculations . Some 
advantages of simulation are the rapidity of calculations, 
magnitude of data which can be generated and analyzed, and 
the ease with which sensitivity analysis may be conducted. 

One of the main disadvantages is that the results obtained by 
simulation are known to be typical of that particular case 
only and in many cases may not be accurately extended to oth- 
er sets of circumstances with any assurance that the results 
are true in general. Also, the results of a simulation study 
may not be at all applicable to an actual situation if the 
model developed for the study does not accurately represent 
the system being analyzed. 

2 . Use of Random Numbers 

Random numbers generated by a computer routine are 
actually pseudo-random numbers in that the same sequence may 
be reproduced at will by means of proper initialization of 
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the sequence. In the strict sense of the word, such replica- 
tions are not truly random and, therefore, detract from 
modeling realistic situations such as demand which is known 
to occur at random. However, it is this property of being 
reproducible that makes pseudo-random numbers useful in a 
study which has as its main purpose the comparison of various 
techniques such as methods of demand forecasting, or the ef- 
fects of variations in the parameters of the model being stud- 
ied. To decide if one demand forecasting technique is 
superior in accuracy to another, the two techniques should be 
compared under similar circumstances, using the identical de- 
mand data. This requirement also is important when the 
effects on forecasting accuracy of changes in parameters, say 
the mean or variance of normally distributed demand, are to 
be measured or tested in some way. If differences in results 
are to be interpreted as indicating an actual difference in 
the methods or parameter effects, then the experimenter must 
insure that such simulation variations are not due simply to 
the randomness of the data generated. 

3 . Simulation Used in this Study ' 

As a basis for comparison of maximum likelihood pre- 
diction and exponential smoothing as described in Sections 
II and III, random demand data were generated, computations 
made according to the method used, and the results compared. 
The various computer routines utilized were written in For- 
tran IV and the simulation was conducted using an IBM 360/67 
computer. The method of generating the actual data for each 
specific model is discussed briefly along with the results in 



30 



the following sections. It cannot be emphasized enough that 
all of the numerical results obtained were simulation results 
and should oe interpreted as such. There is no claim made 
that the results of this thesis are true in general but only 
that they may give some idea as to properties of the methods 
tested. Tne results presented are but a small fraction of 
the total cases examined and situations studied. Some of the 
results of the study have been summarized or eliminated en- 
tirely for the sake of brevity, but all of the examples pre- 
sented are fairly typical of the general results obtained. 

B. PRELIMINARY CONSIDERATIONS 

Before presenting the simulation results a few comments 
must be made concerning what criteria should be used in judg- 
ing whether or not a forecasting technique is satisfactory 
for a particular purpose. Brown [Ref. 2] suggests the fol- 
lowing three criteria as a starting point: accuracy; simplic- 

ity of computation; flexibility to adjust the rate of response. 
Accuracy of the forecasting method is discussed in the follow- 
ing section. For the models examined in this section, flexi- 
bility of the forecasting technique is not important in that 
it is assumed that the model is good and that the underlying 
parameters remain constant over time. Simplicity of computa- 
tion is discussed in the next paragraph. 

One of the main advantages claimed by Brown for exponen- 
tial smoothing is the ease and simplicity of calculation, to 
include consideration of the actual computation process and 
the computer storage required to store the information 
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necessary to make running computations over time. By 
formulas (3—2 ) , (3-3) , and (3-7) it is easily seen that 

smoothing requires the retention of only the current value of 
S (Y) and A for the constant mean case and, in addition, the 
current value of S (Y) for the linear model. At each obser- 
vation only multiplication and addition are required to up- 
date the estimate. Smoothing advocates claim [Ref. 1] that 
complete historical records must be maintained in order to 
estimate the parameters in the model by means of maximum like- 
lihood or least squares regression in the linear case. How- 
ever, it is well known from decision theory that to estimate 
the parameters in the constant mean case when demand is 

2 n n 2 
N(vi,cr ), the statistics £ Y. and \ Y. are sufficient [Ref, 

1=1 i=l 

4] . This means that these two statistics are all that must be 
stored from one period to the next in order to update the 
estimates with each new observation. Granted, these statis- 
tics must be combined at each step with an additional opera- 
tion or two required including the taking of a square root in 
the estimation of o r but with the use of modern high speed 
computers in most large scale inventory control operations the 

extra time required for these calculations is probably negli- 

n 

gible . For the linear model the statistic £ X.Y. where X. 

i=l 11 1 

is the number of the period in which demand Y_^ is observed 
must also be maintained. The identities which express the 
formulas in Section II in terms of the above sufficient stat- 
istics are developed in Appendix A. 
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C. COMPARISON BY SIMULATION 

For simulation purposes random demand was generated in 
accordance with the model being studied. Uniform random 
deviates, 1(0,1), were first generated and, recalling the 
asymptotic normality of the sum of uniform random variables, 
normal demand was in turn generated. The random demand was 
tested by several methods to assure that it was indeed normal 
with the desired parameters. The Chi-Square test (d.f. = 9) 
was used and the resulting test statistic value of 12.8 fell 
below the critical value of 21.7 at the 1% level of signifi- 
cance. The normal demand was either used directly in the 
constant mean case or converted for the linear model by means 
of an appropriate linear transformation. 

Most of the results of the simulation are presented in the 
form of tables depicting the methods of estimation and the 
resulting sample quantities of interest. Various streams of 
random demand were used in many instances from table to table, 
but within each table the same data were used for each case 
in order to make suitable comparisons. 

1 . Simplified MLE and Smoothing in the Constant Mean 
Model . 

Since Zehna [Ref. 11] and Ornek [Ref. 6] previously 
reported on several aspects of the case where demand is 
normally distributed with constant mean and demonstrated what 
appeared to be a significant difference in the accuracy of 
estimates of the mean obtained by maximum likelihood and ex- 
ponential smoothing, it was decided to take that model as a 
point of departure for the simulation study in this thesis. 
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In this section various results of forecasting demand using 
the simplified maximum likelihood approach Eq . (2-12) and 

smoothing Eq . (3-9) are presented. 

As mentioned in the introduction, the approach taken 
was to generate random demand, make estimates of the various 
parameters after particular sampling intervals, replicate the 
experiment a number of times, and then determine the average 
values of the estimates in the particular periods and the 
variability of the estimators over the replicated experiments. 
Table IV presents these results for estimates of reorder level 
after 15, 30, 45, and 300 periods of demand observation over 
100 replications. The parameter pairs (m,ct) are of the same 
proportions as those studied by Zehna and previously indicated 
to be realistic by NavSup. The theoretical reorder level was 
determined from Eq. (2-11) with the risk level constant at 
0.05 for each parameter pair. In this case, and throughout 
the study unless otherwise indicated, the initial values for 
smoothing were chosen in order to give smoothing the best pos- 
sible starting conditions, that is, 

S 0 (Y) = and A 0 (Y) = (4-1) 

A smoothing constant value of a = 0.20 was used, the value 
presently being used by NavSup in their forecasting models. 

The entries in Table IV are the average estimate of 
reorder level and the sample standard deviation of the esti- 
mators over 100 replications for the indicated parameter 
pairs and number of periods. The results are rather striking 
and consistent. In most cases exponential smoothing tends to 
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overestimate the reorder level while MLE underestimates, a 
result noted also by Zehna. However, the most noteworthy 
aspect is the much larger variability of the smoothing esti- 
mate demonstrated by a relatively higher sample standard 
deviation. As more and more observations are taken the sample 
standard deviation of the maximum likelihood estimate decreases, 
a fact that is easily verified in the theory. However, this 
property is not demonstrated by the smoothing estimates. 

This point was suggested by Ornek [Ref. 6] and indicated by 
Eq. (3-12), In fact, after 300 periods the sample standard 
deviation of smoothing is consistently over six times as 
large as that of maximum likelihood. 

It is indicated in Table IV that for these particular 
parameter pairs the results do not depend, on the average, 
upon the various values of the parameters. For example, the 
average reorder level estimate given by MLE after 15 periods 
for the pair (10,1) is exactly 1/10 th the average estimate 
for the pair (100,10) . This result occurred in every case 
examined. This is not surprising since the random demand 
for each case was generated by the identical stream of random 
numbers, altered only by the specific parameters. For this 
reason it was decided to consider further only the parameter 
pair (100,10) as a typical case. Continuing with this ap- 
proach, Table V presents the results for the (100,10) para- 
meter pair with estimates being made after 5 to 100 periods . 

The entries are the average reorder level and sample standard 
deviation of the estimates over 100 replications. These re- 
sults are consistent with the ideas brought out by Table IV, 
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ESTIMATE OF REORDER 
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Actual Reorder Level 



TABLE V — Continued 
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Actual Reorder Level 



but some additional insight into the source of reorder level 
estimate variability may be gained by examining Table V. 

After 100 periods the sample standard deviation of R for risk 
level 0.01 is over twice as large as the same statistic for a 
risk of 0.50. This is because in the estimate at the former 
risk level, considerable weight is given to the estimate of 
a while at the latter risk level, K = 0 and the reorder level 
estimate is determined solely by jj . Thus, the weight given 
to a in the estimate decreases as the risk increases from 
0.01 to 0.50 and the corresponding variability of R decreases. 
The same general effect is true for R but to a lesser extern-. . 
This is to be expected since, from the theoretical development, 
a is roughLy twice as variable as y in the limiting case (2.3 
times for a = 0.2) . For MLE the variance of a is half that 
of y . 

To determine the effects of staring conditions, or 
initial values, on the ultimate estimate given by smoothing 
the same experiment was conducted comparing the perfect initial 
values given in Eq . (4-1) with S q (Y) = 0 and A q (Y) = 0. The 

results are listed in Table VI for various values of the 
smoothing constant a to determine how long the initial values 
effect the reorder level estimate and the variance of the 
estimate. In the Table, R1 implies the zero initial values 
and R2 indicates the best possible starting conditions. The 
approximate period in which the effect of the initial values 
become negligible is indicated as the "catch-up period. " 

One of the most interesting results of this Table is that the 
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TABLE VI 



EFFECT OF INITIAL CONDITIONS 
FOR SMOOTHING WITH VARIOUS SMOOTHING CONSTANTS 





I 


7 r-* 4- A 




Number of Periods 




a 


CUP* [ 

ll 


1. Ej L X ~ 














nator 


5 


10 


20 


40 


75 


100 






Rl 

R2 


63.76 


104 . 27 


141.13 


142 . 50 


122.68 


118.26 


0.05 


Over 

100 


2.95 

116.73 


3.58 

116.25 


3.36 

116.59 


3.95 

116.47 


2.94 
116 .47 


2.68 
116 .48 






1. 72 


1.94 


2.18 


2 . 42 


2.62 


2.57 






Rl 


106.37 


143.12 


142 . 58 


120.85 


116.60 


116.56 


0.10 


I 

100 


5.16 


5.60 


4. 82 


4.18 


3.61 


3. 88 




I 

1 


R2 


116.21 


116 .09 


116 . 73 


116 . 33 


116.48 


116.55 






3.24 


3.21 


3. 40 


3.61 


3.60 


3. 88 






Rl 


132.66 


149.51 


128.14 


116 . 70 


116 . 45 


116.55 


0.15 


65 


6.92 


7.00 


5.31 


4.65 


4.46 


4 . 88 






R2 


116.12 


115.98 


116 . 81 


116.22 


116.45 


116.55 






4.58 


4.08 


4 . 32 


4.58 


4.46 


4 . 88 






Rl 


140.98 


140.98 


123.60 


116 . 34 


116.43 


116.54 


0.175 


55 


7.70 


7.38 


5.49 


5.06 


4 . 85 


5 . 30 






R2 


116.08 


115.92 


116 . 83 


116.18 


116.43 


116.54 






5.19 


4.41 


4 . 72 


5.04 


4.85 


5.30 






Rl 


146.69 


142.92 


120 . 72 


116.21 


116 . 42 


116.52 


0.20 


45 


8.43 


7.44 


5.59 


5 . 47 


5.21 


5.69 








116.05 


115 . 88 


116.83 


116 . 17 


116 . 42 


116.52 






R2 


5 . 76 


4.68 


5.09 


5.47 


5.21 


5.69 






Rl 


150 .18 


138.15 


118.96 


116.18 


116.42 


116.50 


0.275 


40 


9 . 13 


7.50 


5.73 


5.89 


5.54 


6.05 






R2 


116.02 


115.83 


116 . 82 


116 . 17 


116.42 


116.50 






6 . 70 


4.92 


5 . 42 


5 . 89 


5.54 


6.05 






Rl 


151.83 


133.48 


117.94 


116.17 


116.40 


116.47 


0.25 


35 


9.81 


7.49 


5 . 52 


6 . 30 


5 . 86 


6 .38 






~ 


115.99 


115 .78 


116 . 80 


116.17 


116.40 


116.47 






R2 


6 . 80 


5.13 


5 . 73 


6 . 30 


5 . 86 


6 .38 



* "Catch-Up Period" 
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choice of smoothing constant a has little effect on the av- 
erage value of the reorder level estimate but the standard 
deviation of the estimate increases with increasing a. This 
is not surprising since for a stable system, as in the cur- 
rent case, the smaller values of the smoothing constant are 
clearly indicated. This choice of a causes the resulting 
estimator to be less sensitive to changes in the system, but 
have much smaller variance than for larger values of the 
smoothing constant. 

One possible measure of the accuracy of an estimator, 
at least for comparison purposes, is the mean squared error 
of the estimate. Mean squared error (MSE) may be defined as 
the sum of the square of the bias c.nd the variance. The 
theoretical MSE and average sample MSE for the (100,10) para- 
meter pair after 200 observations of demand are displayed in 
Table VII for the MLE and smoothing estimates of reorder 
level. The sample results compare favorably with the theo- 
retical values which for smoothing are approximations based 
upon the results of Section III. Not surprisingly, the re- 
sults favor the MLE procedure , due in large part to the much 
smaller variance. In fact, the variance in both cases is 
the dominating factor as the bias is negligible after 200 
periods . 

As one further illustration of the greater sample 
variance of R, reorder level for the (100,10) case was esti- 
mated using 1000 periods of demand observations and the ex- 
periment repeated 100 times. A graph of the MLE and smoothing 
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TABLE VII 



THEORETICAL AND SAMPLE M3 AN SQUARED 
ERROR IN PERIOD 200 



Theoretical Mean Squared Error 








Smoothing 


with 


Indicated a 




Risk 


MLE 


0.10 


0.15 


0.20 


0.25 


0.30 


0 . 35 


0.40 


0 .01 


1.87 ; 


21. 79 


33.74 


46.50 


60 . 10 


74.60 


90.05 


106.56 


0.05 


1.18 


13.50 


20.89 


28.75 


37.12 


46.04 


55.53 


65.65 


0.10 


0.91 


10.27 


15 . 87 


21.82 


28.16 


34 . 89 


42.05 


49.69 


0.25 


0.62 


6.65 


10.26 


14.08 


18.13 


22.43 


26.99 


31.35 


0.50 


0.50 


5.26 


8.11 


11.11 


14.29 


17.65 


21.21 


25.00 


Average Sample Mean Squared Error over 100 


Replications 








Smoothing 


with 


Indicated a 




Risk 


MLE 


0.10 


0.15 


0 . 20 


0.25 


0 . 30 


0 . 35 


0 .40 


0.01 


1.64 


22.94 


35.64 


50.55 


67.26 


84 . 72 


102.28 


119 .63 


0.05 


1.01 


14.06 


22.29 


32.01 


42.99 


54.56 


66.24 


77. 84 


0.10 


0.78 


10.57 


17.01 


24.64 


33. 25 


42.37 


51.60 


60 . 82 


0.25 


0.53 


6.62 


10 .98 


16.10 


21.82 


27.87 


34.04 


40.24 


0.50 


0 . 47 


4.99 


8.37 


12.16 


16.21 


20 . 40 


24.66 


28.99 



estimates of reorder level over the 100 replications is dis- 
played in Fig. 1. The theoretical reorder level for a risk 
of 0.05 is 116.45. Although the average values of R and R 
are not significantly different, the difference in variabil- 
ity is quite obvious. In fact, even after the 1000 periods 
the sample standard deviation of R is 5.654, while that of 
R is but 1.007. In view of many other experiments of this 
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Figure 1. Reorder Level Estimate 
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type that were conducted, it appears that for a risk level of 
0.05 and a = 0.2, a lower bound on the standard deviation of 
R is approximately a/2. No results to that effect have been 
determined analytically so the remark is included here only 
as a matter of interest. 

As a final example of results under this particular 
case, a comparison of the sample risks obtained by both meth- 
ods of estimation was made. Again with y = 100 and a = 10 
demand was generated and the reorder level estimated after 
various sampling intervals. This time the experiment was 
repeated 500 times and the risks over the 500 replications 
in the indicated periods are presented in Table VIII. In 
several of the cases the results do not differ significantly 
in one way or the other, but generally both methods tend to 
yield higher risk than desired. In the case of smoothing 
this is due to the variability of the estimate, and in the 
case of MLE the higher risk may be attributed to the tendency 
to underestimate reorder level. The risk obtained by smooth- 
ing was greater than that obtained by MLE in the majority of 
instances . 

2 . Linear Model 

In this section a study similar to that of the previ- 
ous section is presented for the linear model. The exact 
results of the maximum likelihood development of Section II 
are used for comparison with the exponentially smoothed esti- 
mates. The constant mean and zero Y-intercept cases are pre- 
sented as special cases of the general linear model. 
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TABLE VIII 



SAMPLE RISK OBTAINED BY SMOOTHING 
AND SIMPLIFIED MLE PROCEDURE 



Desired 


Esti- 




Number of 


Periods 




Risk 


mator 


10 


20 


50 


100 


150 


200 


0 


.01 


A 

R 


.044 


.016 


.018 


.014 


.006 


.010 






R 


.026 


.024 


.036 


.030 


.014 


.032 


0 


.05 


A 

R 


.078 


.076 


.062 


.064 


.046 


.060 






R 


.088 


.078 


.0 70 


.096 


.064 


.076 


0 


O 

i— 1 


A 

R 


.134 


.132 


.120 


. 134 


.086 


.108 






R 


. 120 


. 120 


. 114 


. 142 


.110 


. 128 


0 


.25 


A 

R 


.272 


.270 


. 248 


. 246 


.224 


.250 






R 


.264 


.284 


.258 


.272 


.256 


.250 


0 


.50 


A 

R 


.514 


.480 


. 504 


.494 


.500 


.482 






R 


.532 


.4 80 


. 496 


.474 


.488 


.472 



The first simulation experiment performed in this 
section was to generate random demand according to the linear 
model given by 

2 

y = a + bx + e , where e 'v, N(0,a ) 

for various values of a, b, and a. The maximum likelihood 
and smoothing estimates of the parameters in the model were 
calculated after 100 periods of demand observation and the 
results averaged over 100 replications. The estimate along 
with the sample standard deviation of the estimates are dis- 
played in Table IX. As indicated in Section III the smooth- 
ing method does not estimate the value of the intercept 
separately but rather the point on the theoretical line at 
each period. For purposes of the Table only, a^ was 
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estimated after the simulation by subtracting the value of 
b t * t from the average smoothed estimate of the point on the 
line for each period of interest, and thus no sample standard 
deviation of a fc was obtained. The results of Table IX are 



TABLE IX 

ESTIMATE OF PARAMETERS 
IN A LINEAR MODEL AFTER 200 PERIODS 



' 

Parameters 


MLE 




Smoothing 


(a,b, a) 


Sample 

Average 


Sample 

Standard 

Deviation 


Sample 

Average 


Sample 

Standard 

Deviation 


(5, .1,1) 










a 


4.84 


.095 


5.23 


* 


b 


.10 


.025 


.10 


.050 


a 


.93 


.077 


1.04 


.294 


(50, .2, 3) 










a 


50 .57 


. 378 


50.6 4 


* 


b 


.20 


.099 


.19 


.151 


0 


2 . 85 


.200 


3.13 


.882 


(100, . 3,4) 










a 


100.24 


. 698 


100.87 


* 


b 


.30 


.117 


.30 


.201 


0 


3.96 


. 347 


4.17 


1.176 


(500, .5,5) 










a 


499.68 


.540 


501.05 


* 


b 


.50 


.249 


.50 


.252 


0 


4.96 


. 377 


5.21 


1.470 



* See Narrative 



not very revealing except that once again the smoothed esti- 
mates are more variable than the corresponding maximum like- 
lihood estimates of the parameters. More interesting was 
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the fact that for many experiments using various values of 
a, b, and a the sample risks obtained by both methods were 
seen to be independent of the particular parameters in the 
model, and appear to depend only upon the number of observa- 
tions upon which the estimates are based. For this reason 
it was decided to again direct attention to only a couple of 
parameter values . 

For the sake of brevity only a few of the experiments 
conducted are reported here, in that the results are much the 
same as in the previous sections. \ T ot much insight could be 
gained from attempting to analyze the sample estimates of the 
parameters. It appeared more beneficial to examine the sample 
risks obtained by the two methods of estimation. Table X 
presents the sample risks obtained after the indicated num- 
ber of periods over 1000 replications. It should be pointed 
out that the entries in the Table are not the fraction of 
time the demand exceeded the reorder level estimates through 
50 periods for example, as was the case in the results pre- 
sented by Zehna and Ornek, but rather the percentage of times 
the actual demand exceeded the reorder- level in the (p+l)st 
period over the 1000 samples of the estimates. The reasons 
for proceeding in this manner were discussed in the introduc- 
tion. For this case the parameters (a,b,a) are (50,2,5) . 

Again the best possible initial values were used for smooth- 
ing, namely 

Sq (Y) = a - (a/3)b , and Sq (Y) = a - 2(a/B)b . 
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TABLE X 



LINEAR MODEL - ACTUAL RISK OBTAINED 
IN 1000 REPLICATIONS (a=50, b=2 , o= 5) 



Desired 

Risk 


> ' 




Number of 


Periods 






10 




20 




50 




100 




- 


/*s 




A 








R 


R 


R 


R 


R 


R 


R R 


0.01 


. 007 


.036 


.014 


.0 27 


.011 


.024 


.008 .029 


0.05 


.045 


.103 


.045 


.0 70 


.049 


.078 


.053 .068 


0 .10 


.092 


. 155 


.098 


. 12 4 


.103 


.137 


.110 .114 


0.25 


.233 


. 306 


.256 


.264 


.249 


. 274 


.256 .249 


0.50 


.482 


.516 


.521 


.519 


.510 


.536 


.505 .517 



The results here are noteworthy in that the sample risk ob- 
tained by the smoothed estimates exceeded that of MLE in 
almost every case, in the sense that the smoothing risk dif- 
fered more in one way or the other from the desired risk than 
did the risk obtained by MLE. It is felt that a sample size 
of 1000 for each period makes the results fairly significant, 
but once again it must be recognized that the results are 
sample values from simulation and should be judged accordingly. 

Since the simplified maximum likelihood procedure 
yielded good results in the constant mean case, it was de- 
cided to try a similar version for the linear case. To ap- 
ply the methods of Section II exactly, and thereby be able 
to make exact probability statements about risk, the value of 
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the t-distribution percentile must be changed with each addi- 
tional period of demand observation. As n, the total number 
of periods, becomes large, however, the value of the t- 
distribution percentiles approach those of a standard normal 
distribution, indicated by K throughout this thesis. In view 
of this, it was decided to determine what effect replacing 
the varying values of t with the corresponding limiting val- 
ue of K, always smaller, would have on the sample risk ob- 
tained by MLE . The resulting estimate is referred to as the 
K-version in this report. In Table XI this has been done for 
the general linear case with parameter triple (100,2,10). 

The Table entries are the average reorder level estimate, 
sample standard deviation of the estimates over 200 replica- 
tions, and the sample risk obtained by the indicated method 
of estimation. The desired risk levels of 0.25 and 0.50 
have been omitted primarily for the sake of brevity once 
again, but also because the values are probably too large for 
any practical application. Tables XII and XIII present the 
same information for the special cases (100,0,10) and 
(0,2,10) respectively. 

The results of these final three tables for this 
section are consistent with the earlier results. The sample 
standard deviation of the smoothed estimates is relatively 
constant over the number of periods and the corresponding 
values for both versions of MLE, although consistently larger 
in the early periods, decrease over time and are roughly a 
third as large as the deviation of the corresponding smoothed 
estimates after 100 periods . The maximum likelihood risk 
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TABLE XI 



REORDER LEVEL, SAMPLE STANDARD DEVIATION OF ESTIMATOR, 
AND SAMPLE RISK OVER 200 REPLICATIONS. LINEAR (100,2,10) 



Desired 


Esti- 


Numb 


ier of Periods 






Risk 


mator 


10 


50 


100 




0.01 


R - T 


156 .63,10 .62 


227.41, 3. 77 


326.11,2.62 








.02 


.01 


.01 






R - K 


150 .67,9.59 


226.59,3.72 


325.71,2.60 








.02 


.01 


.01 

■ 


1 

1 




R 


145. 83,8.54 


228.11,8.58 


327.60, 8.65 


! 

i 






.03 


.01 


.02 

f 


0.05 


/s. 

R - T 


144.64, 8.67 


219.74,3.34 


318.87,2.31 








.04 


.05 


.04 






A 

R - K 


142 .36,8.36 


219.42,3.3 2 


318.70,2.30 








.05 


.06 


.05 






R 


138.92,7.26 


220.51,7.24 


319.98,7.16 








.08 


.07 


.06 




0.10 


A 

R - T 


139 .19,7.97 


215 . 81,3.16 


315.07,2.18 








.08 


.08 


.13 






R - K 


137.96,7.84 


215.63,3.15 


314.99,2.18 








.09 


.09 


.13 






R 


135.27,6.67 


216.48,6.66 


315.94,6.49 








.13 


..11 


.12 
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TABLE XII 



REORDER LEVEL, SAMPLE STANDARD DEVIATION OF ESTIMATOR, 
AND SAMPLE RISK OVER 200 REPLICATIONS. LINEAR (100,0,10) 
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TABLE XIII 



REORDER LEVEL, SAMPLE STANDARD DEVIATION OF ESTIMATOR, 
AND SAMPLE RISK OVER 200 REPLICATIONS. LINEAR (0,2,10) 
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using the exact t factors is never larger than the risk 
when the corresponding values of K are used. On the basis 
of these results, it would be difficult to state that the 
modified K -version of maximum likelihood estimation of re- 
order level is significantly poorer than the actual estimator 
using t. In fact, in the earlier periods the sample standard 
deviation of the K-version is smaller than that of the 
actual MLE estimation technique. This of course is due to 
the smaller weight placed upon the estimate of o in the 
earlier periods. 
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V. BAYES PROCEDURES 



At times the manager of a supply system may have "a 
priori" degrees of belief about the values of the unknown 
parameters in the demand distribution assumed for a partic- 
ular model. In other words, he may subjectively feel, prior 
to actually observing any demand data, that certain values of 
the unknown parameters in the model are more likely than 
others. He may further wish to include his subjective feel- 
ings along with observed values of the random demand to make 
estimates of the unknown parameters. Bayesian methods may 
accomplish the above, combining the real data and subjective 
feelings into estimates which actucilly minimize the decision 
maker's risk as defined in a very special way. 

The subjective probabilities which measure the degree of 
belief discussed above determine a prior distribution for the 
parameters. The sample values of the observed demand are 
used with the prior distribution to determine a posterior 
distribution which is then used in order to obtain an esti- 
mate of the unknown parameters. The choice of a prior distri- 
bution for the unknown parameters is certainly arbitrary and 
has lead to general controversy as to the appropriateness and 
value of Bayesian methods. It is not much more arbitrary, 
however, than the selection of initial values for use with 
exponential smoothing estimation techniques. These initial 
values were shown in Section IV to effect the estimate for as 
many as 100 periods, depending upon the value of the smoothing 
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constant, another aspect of smoothing which is certainly open 
to subjectivity. Without attempting to discuss the appropri- 
ateness of Bayesian methods for the demand forecasting prob- 
lem at hand, and certainly without recommending its use 
without further examination, the concept is developed in this 
section as it applies to the model of normal demand with con- 
stant mean. The Bayes estimation results are then compared 
to exponential smoothing in a variety of situations. 

A. BAYES ESTIMATION 

One of the many limitations of Bayesian procedures is the 
extreme difficulty of obtaining joint prior and posterior 
distributions for estimating more than a single parameter in 
a model. However unrealistic it may appear, it will be as- 
sumed here that demand is normally distributed with mean 0 

2 

and known variance a . This might even be plausible in cases 
where the demand for various items of supply demonstrates 
about the same degree of variability but fluctuates about 
different and unknown mean values. In any case, it is as- 
sumed that 

~^ ( y - 0 ) 2 

f (y I 0) = — - e 2cr 

a) 2?r 

i 

where f(y|0) indicates that the distribution of Y is condi- 
tioned on the value of the parameter 0. It can be shown 

[Ref. 8] that if the prior distribution assumed for 0 is 
2 

N(0 Q ,a o ) , then the posterior distribution is also normal 
with mean 
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2 2 
a Q y + a z 0 



o 



(5-1) 



and variance 




2 2 
a °o 

c 2 + c 2 
o 



(5-2) 



Thus the mean of the posterior distribution is a linear com- 
bination of the observed demand and the mean of the prior 
distribution 0 q . For estimation purposes it can be shown 
that the mean of the posterior distribution as an estimate 
of the parameter 0 minimizes the loss as measured by squared 
error. That is, if 0 ' is an estimate of 0 and the conditional 
and prior distributions are as specified above, then (0 - 0'} 2 
is minimized by choosing 0 1 equal to the mean of the posterior 
distribution . 

It should be noted that in Eq . (5-1) the weight given to 

a 2 

the observation in the linear combination is — - ° — ^ so that 

o z + o z 

o 

if the decision maker is fairly certain that 0 Q is the true 
value of the parameter 0 or if he wishes to bias the estimate 
in favor of his prior feeling, he should choose a small vari- 
ance for the prior. If the exact value of 0 is less certain 

however, the variance of the prior should be relatively large 
2 

compared to 0 . In this way, most of the weight in the re- 

sulting estimate will be placed on the observed demand. 
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B. BAYES FORECAST MODEL 



The Bayes estimate of mean demand developed in the pre- 
ceding section may be used to forecast demand and set an ap- 
propriate reorder level, to obtain an approximate risk in a 

manner analogous to the procedures used in Section II -D. The 

o 

demand rn period one is observed and assuming a N(0 Q ,a^) prior 
distribution on G , a posterior distribution of Q given y^ may 
be determined. As previously derived, the posterior is normal 
with mean 



and variance 



0 * = 



°o y l + ° 2e o 

a O + ° 2 



°1 = 



°o a 



2 2 
a Q + a 



The estimate of reorder level then is given by 



RJ = 0f + K a 



Since information is gained by the observation y^ , the poste- 
rior distribution determined in period one is used as the prior 
distribution for period two. After observing , a new poste- 
rior distribution for 0 is determined. It is again normal 
with mean t 



0 2 




+ a 2 e 



+ O 



* 

2 



and variance 



c 



2 2 
l a 




+ 



2 * 

a 
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2 

Upon substitution for 0* and o'^ the above becomes 

°o ( yi + i’2> 4 ° 2 ° 



0 * = 



2 2 
2a + a z 
o 



„ 2„2 

r . a a 

O -i 2 O 

— and a- 2 - — ^ ^ . 



2a ' + o' 

o 



Continuing in this manner it may be shown by induction that 
after n periods the estimate and variance are 

n 



0 * = a 2 T y . + o 2 0 
n o • u , J i < 

i=l 



7 



and 



2 

a n = 



2 2 
a o a 



It follows that 



E (0* ) = 

n 



2 7 2 

n o q + a 



na 2 0 + a 2 0 
o o 

2 2 
n o Q + a 



and 



lim E(0*) = 0 

n 

n-vra 



lim a = 0 



n- 



n 



The reorder level set for the (n+1) st period is given by 



R* = e* + KO 
n n 



where K is the (l-a)th percentile of the N(0,1) distribution, 



C. SELECTION OF PARAMETERS FOR THE PRIOR — SIMULATION RESULTS 

The results of Bayes estimation depend strongly in most 

cases upon the choice of the parameters in the prior distribu- 

2 2 . . 

tion. The ratio of a Q to a determines the weight given to 

the observed demand relative to the weight placed upon the 

"a priori" best guess of mean demand 0 . To determine exactly 

how much the estimate of mean demand is effected by various 

2 

combinations of 0 and o , a limited amount of simulation was 

o o 
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conducted. For the constant mean case with 0 = 100 and 

a = 10 , random demand was generated for 100 periods. The 

Bayes estimate of 9 was computed for combinations of 0 Q = 0, 

0, 6/2, 0/3, 0/4, 9/5, and a 2 = a, a 2 /2, a 2 /3, a 2 /4, a 2 /5, 

2 2 2 2 

2a , 3a , ^a , 5a . The results of this investigation are 

summarized in Table XIV, the entries being the Bayes estimates 

of 0 after 5 and 100 periods of demand observation. Obviously 

the closer 0 is to the true value of 0 the better the result- 
o 

ing estimate is. It should be noted however, that for this 
') 2 

ratio of a''/a sufficiently large, the Bayes estimate ap- 
proaches the actual value of 0 rapidly. For example, in the 

2 2 

above case with 0 =0 and a =5 a the Bayes estimate of 

o o 

0 was 96.6 after just 5 periods and 99.5 at the end of 50 
periods . 

From the detailed data obtained it appeared that the Bayes 

estimate was fairly stable especially for a relatively good 

selection of 0 . For this reason it was decided to compare 

the Bayes estimates with the corresponding exponentially smo- 

2 

othed estimate when a is known. The results of this compar- 
ison are displayed in Table XV. For purposes of comparison 
the initial guess of mean demand was set at zero for both 
estimation techniques, that is, S q (Y) = 0 and 0^=0, and ran- 
dom demand generated with 0 = 100 and a = 10 . The mean demand 
was estimated using each method with various choices of the 

observed value weighting factors, a in the case of smoothing, 
o 

and a for the Bayesian method. The entries in the Table re- 
o 

veal that the Bayesian approach yields better estimates than 
smoothing in the early periods even for a relatively large 
smoothing constant. A choice of a >_ 2a resulted in better 

n 
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TABLE XV 



AVERAGE ESTIMATE OF MEAN DEMAND 
BY BAYES AND SMOOTHING 



0 = 100 
a = 10 


0 = 0 
s“(Y) = 0 


Number of Periods 


Estimator 


Parameter 

Values 


5 


10 


15 


50 


100 




2 

o - a 
o 


83.7 


91.0 


93.7 


97.9 


99.1 




O 2 

2 a 


91.3 


95.4 


96 . 8 


98.8 


99.5' 


BAYES 


o 














3 a 


94.1 


96 .9 


97.8 


99 . 2 


99.7 




4a 2 


95.6 


97.7 


98.3 


99 . 3 


99 . 8 




_ 2 
5a 


96.6 


98.2 


98.7 


99.4 


99 . 8 




a. = 0 . 1 


41.1 


65.1 


79.3 


99.2 


100.4 




0.2 


67.3 


89 .1 


96 . 4 


99 . 8 


100.6 


SMOOTHING 
















0. 3 


83.2 


96.9 


99.5 


99.9 


100 . 8 




0.4 


92.1 


99.0 


100 .0 


99.9 


100.9 




0.5 


96.5 


99.5 


100.3 


99.9 


100 .9 



estimates by the Bayes technique through 15 periods than the 
smoothing estimator with the usual smoothing constant 0.20. 

To determine the effect of the prior parameters on the 
sample risk obtained by the forecasting techniques the same 
model was used to generate demand for 100 periods and the 
experiment replicated 1000 times. For the case where the 
prior estimates were the best possible, that is, S Q (Y) = 0 
and 0 q = 0 , the results are presented in Table XVI. For 
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various choices of weighting factors, the desired risk was 
held constant at 0.05. The results are quite remarkable. 



TABLE XVI 

SAMPLE RISK OBTAINED BY BAYES 
AND SMOOTHING OVER 1000 REPLICATIONS 
(GOOD INITIAL VALUES) 



6 = 100 


6 = 100 
o 












a = 10 


S (Y) = 100 
o 




Number 


of Periods 




Estimator 


Parameter 

Values 


5 


13 


15 


50 


100 




2 ^ 2 
a = a 
o 


;066 


.056 


.057 


.057 


.055 


BAYES 


2a 2 


.069 


.057 


.058 


.057 


.055 




3a 2 


.072 


.058 


.058 


.057 


.055 




4a 2 


.073 


.059 


.058 


.057 


.055 




5a 2 


.074 


.059 


.058 


.057 


.055 




a = 0.1 


.060 


.051 


.057 


.057 


.063 




0.2 


.059 


.061 


. 061 


.061 


.069 


SMOOTHING 


0 . 3 


.066 


.073 


.069 


.0 70 


.079 




0.4 


.073 


.077 


.079 


.076 


.0 83 




0.5 


.082 


.0 82 


.083 


.084 


.091 



Except for a few cases the Bayes forecasting method provides 
a sample risk closer to the desired risk than the smoothing 
process . Also noteworthy is the fact that the Bayes method 
again appears much more stable than smoothing. This is in- 
deed the case as indicated by the sample standard deviation 
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of the estimates of mean demand for the exeriment displayed 
in Table XVII. For only the early periods and for small 

TABLE XVII 

SAMPLE STANDARD DEVIATION OF BAYES 
AND SMOOTHING ESTIMATORS 0.7 MEAN DEMAND 



0 = 100 
0 = 10 


0 = 100 
o 

S (Y) = 100 
o 


Number of Periods 


Estimator 


Parameter 

Values 


5 


10 


15 


50 


100 






2 

o 


3.95 


3.26 


2.84 


1.9 5 


1.69 






20 2 


4.25 


3.40 


2.94 


1.97 


1.72 




BAYES 


3a 2 


4 . 36 


3.41 


2 .94 


1.96 


1.73 






4a 2 


4.46 


3.47 


2.95 


1.95 


1.72 






5a 2 


4.48 


3.48 


2.97 


1.95 


1.72 






a = 0 . 1 


2.31 


2.6 4 


2.68 


2.61 


2 . 73 






0.2 


3.41 


3.68 


3.63 


3.60 


3.65 




SMOOTHING 


















0.3 


4.30 


4.51 


4.40 


4.49 


4.44 






0.4 


5.10 


5.27 


5.17 


5.30 


5.16 






0.5 


5.96 


6.03 


5.90 


6.06 


5 .88 





values of the smoothing constant does the smoothed estimate 
demonstrate a smaller variance than the corresponding Bayesian 
estimate. As more and more observations are made, the sample 
standard deviation of the Bayes estimate becomes smaller 
while the variance of the smoothed estimate remains about the 
same . 
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0 



The sair.e experiment was repeated with = 0 and S (Y) = 
to determine how quickly the two forecasting methods could 
demonstrate sensitivity to the actual data and obtain a sample 
risk close to the desired risk 0.05. In this case also the 
experiment was replicated 1000 times in order to obtain a 
degree of significance in the results. The sample risks 
obtained by both methods arc displayed in Table XVIII. The 

TABLE XVIII 

SAMPLE RISK OBTAINED BY BAYES 

AND SMOOTHING OVER 1000 REPLICATIONS 
(POOR INITIAL VALUES) 



e = loo 

a = 10 


e o = 0 
S q (Y) = 0 




Number 


of Periods 






Estimator 


Parameter 

Values 


5 


10 


15 


50 


100 






2 2 
a o = a 


.635 


.294 


.187 


.079 


.071 






0 2 
2a 


. 310 


.159 


.10 7 


.068 


.062 




BAYES 


, 2 
3a 


.211 


.128 


.090 


.066 


.061 






, 2 
4 a 


.168 


.109 


.084 


.065 


.061 






. 2 
5a 


. 146 


.097 


.080 


.065 


.060 






a = 0.1 


1.000 


.992 


.737 


.067 


.063 






0 . 2 


.995 


. 390 


. 145 


.061 


.069 




SMOOTHING 


0.3 


.761 


.137 


.080 


.070 


.079 






0.4 


.365 


.088 


.0 80 


.076 


.083 




• 


0.5 


.187 


.084 


.083 


.084 


.091 
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sample standard deviations of the estimators did not change 

appreciably from the previous case and are not presented. 

The results are as expected. VJhen a small value is chosen 

for the smoothing constant the long term results are fairly 

accurate, but the results in the early periods are far from 

satisfactory. If a larger value is chosen for a the smoothed 

estimate adjusts much more rapidly but gives poorer results 

in the long run due to the increased variance. The Bayes 

2 2 

estimator adjusts quite rapidly for o q = 5a and the long 
term results for all values tested are fairly accurate. 
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VI. 



CONCLUSIONS AND RECOMMENDATIONS 



Before summarizing the conclusions reached in this 
study and the recommendations which follow, it may be worth- 
while to present briefly a few of the conclusions reached by 
Zehna and Ornek in similar studies [Refs. 10, 11, and 6]. 

1. It is difficult to judge exponential smoothing on a 
theoretical basis due to a lack of knowledge concerning the 
probability distributions of the smoothing estimators. 

2. Many of the analytical results obtained by Brown 
[Ref. 2] are asymptotically valid only although applied to 
the finite demand case. 

3. The source of much of the difficulty in determining 
the distributions of the smoothing estimators is in the use 
of MAD as an estimate of variability. 

4. On the basis of simulation results the simplified MLE 
approach is to be preferred in those cases in which smoothing 
is presently employed by NavSup. 

A. CONCLUSIONS 

It has been pointed out numerous times throughout this 
thesis that the numerical results and comparisons made are 
based on computer simulation sample outcomes. However, the 
experimentation was conducted with great care taken in the 
handling of random numbers and sample size to assure a degree 
of significance in the results. The results of the various 
comparisons of MLE and exponential smoothing are quite con- 
sistent and reinforce the conclusions reached in the above 
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references* Without regressing to the specific numerical 
examples presented previously in th:.s thesis it can be con- 
cluded that on the basis of the analysis performed in this 
study the smoothing estimators are more variable than any 
form of maximum likelihood estimation and prediction method 
tested in the linear or constant mean models. The source 
of most of the variability in the smoothed estimates appears 
to be in the MAD estimate of the variance in the demand 
distribution. This results in far less consistent risks ob- 
tained by smoothing than the comparable risks obtained using 
MLE . As for the ease or difficulty of computation using the 
various methods, the exact MLE computations are certainly 
more involved but probably not to the extent of resulting in 
significantly increased computation time. In any case the 
trade-off between computational accuracy and computation time 
is a managerial decision. In the constant mean model the 
simplified MLE technique is certainly not more difficult to 
handle computationally than smoothing and has been shown to 
produce significantly more stable and accurate results. The 
Bayesian approach, while not immediately applicable without 
further study, appears to offer a procedure which, although 
subjective in nature, has a sounder theoretical basis than 
does smoothing. On the basis of simulation the Bayes method 
has been shown to produce good estimates, be relatively flex- 
ible and sensitive, and demonstrate less variability than 
exponential smoothing. 
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B. RECOMMENDATIONS 



For the reasons discussed above it is strongly recommended 
that NavSup or other agencies using smoothing for demand fore- 
casting under the assumptions of the models discussed in this 
thesis give serious consideration to at least testing other 
forecasting techniques under a variety of circumstances to 
compare and contrast the results. This testing should involve 
the use of actual historical demand data so the results ob- 
tained by the alternate forecasting techniques can be com- 
pared in retrospect with the forecasting results which were 
obtained by means of exponential smoothing. Because of its 
intuitive appeal and computational simplicity exponential 
smoothing should be investigated further with prime interest 
in discovering a method to replace MAD as an estimate of 
variance. If this can be accomplished it may turn out that 
smoothing is in some respects an optimal method of forecast- 
ing. It is also recommended that further study be made of 
the Bayesian approach developed in this thesis to determine 
its applicability in actual situations. It would be reward- 
ing to develop a Bayesian procedure which could be used to 
jointly estimate the mean and variance of the demand distri- 
bution and yet be relatively simple to apply. In cases such 
as those involving a constant proportionality between the 
mean and variance of the underlying distribution it is spec- 
ulated that an ad hoc procedure could be developed that would 
involve an "a priori" estimate of the variance and then, after 
data are observed, the use of the posterior distribution to 
estimate the variance as a function of the posterior mean. 
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Finally it is recommended that an investigation be made into 
the use of smoothing, MLE , Bayesian methods, or other fore- 
casting approaches in the case of a time dependent mean when 
demand is normally distributed but mean demand is subject to 
periodic increases or decreases. In such cases the esti- 
mating procedure must not only be able to yield accurate 
estimates but also be able to detect the changes in the mean 
of the under] ying demand distribution and adjust the fore- 
casts accordingly [Refs. 3, 7, and 9]. 
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APPENDIX A 



IDENTITIES USED IN MLE COMPyTATIONS 



As discussed in Section IV, one of the disadvantages 
often attributed to maximum likelihood estimation techniques 
applied to demand forecasting is the necessity of storing the 
historical account of demand data lor the entire period of 
interest. The following results show that the above asser- 
tion is not valid and that only certain totals are required 
to be stored much the same as with exponential smoothing. 

A. CONSTANT MEAN MODEL 



a 





but i ty. - a.) = l ~ Y) = 2. Yy " Z Y l Yy + n Y* 

i=l i=l i=l i=l 
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n n 2 

which shows that l y. and l y“T are the only data that must 

i=l i=l 

be retained from period to period. 



B. LINEAR MODEL - Y-INTERCEPT = 0 



b 



n 



i=l 



x . y . 
x J 1 



l 



X 



i=l 



2 

i 



Y # x 



n 



l (Yy - bx ) 
i=l 1 1 

n - 1 



n 



n 



n 



^ ry U A /-VH /N 0 H ^ 

but l (Y; " bx.) = I Y: - 2b [ x.y. + b I xJ 
i=l 1 i=l 1 i=l 1 1 i=l 1 



n 2 * n 



= J/i - b I, x iA 
1=1 1=1 



so that 



y • x 



n ~ n 

l y 2 - b l x.y. 
i=l 1 i=l 1 1 
n - 1 



and the only additional information which must be stored is 



V 2 

£ y i ‘ 
1=1 
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C. GENERAL LINEAR CASE 



n 



b = 



l x ± y i - nxy 
i=l 



n 



s = *i - 
1=1 



2 -2 
nx 



N 

D 



a = y - xb 



n 



y • x 



.1 (Yi - a - bx ± ) 



Nov; omitting the limits on the sums for simplicity 



n 



/V /\ 



| tYi - a - bx i > 2 = h 2 - 2a hi - 2 b I x iYi 



A O A A A ^ 

+ b £ x i + 2ab£ x^ + n a 



which becomes after substituting for a and combining terms 
hi - n y 2 - 2b(h i Y i - n xy) + b 2 (£x 2 - n x 2 ) . 

But the quantities in parenthesis are just N and D respectively 
so noting that 



D = b • N 



it follows that 



n a a o n « /n 

l (y. - a - bx ± ) Z = [ yh n y - bN 
i=l i=l 



and 



n 



I Yt - n y 2 - bN 
i=l 



y • x 



n - 2 
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Note that D depends only upon the number of periods n and 
if the demand is being followed over successive perioas of 
time the x . 1 s are just the first n integers so that the value 
of D can be stored for each period of time only and not 
calculated for each particular item of supply. 
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